SemanticScuttle - klotz.me » klotz: retrieval augmented generation

klotz: retrieval augmented generation*

RAG isn’t dead, but context engineering is the new hotness

The article discusses the evolution from RAG (Retrieval-Augmented Generation) to 'context engineering' in the field of AI, particularly with the rise of agents. It explores how companies like Contextual AI are building platforms to manage context for AI agents and highlights the shift from prompt engineering to managing the entire context state.

2026-01-28 Tags: rag, context engineering, agents, llm, mcp, langchain, anthropic by klotz

AGENTS.md outperforms skills in our agent evals

Vercel's research shows that embedding a compressed 8KB docs index in AGENTS.md achieves a 100% pass rate for Next.js 16 API evaluations, while skills maxed out at 79%, even with explicit instructions. This suggests that passive context provision via AGENTS.md is more effective than active retrieval with skills for framework-specific knowledge in AI coding agents.

2026-01-28 Tags: coding, agents, agents.md, skills, next.js, rag, epistemology, vercel by klotz

LLMs create a new blind spot in observability

Logs, metrics, and traces aren't enough. AI apps require visibility into prompts and completions to track everything from security risks to hallucinations.

2026-01-25 Tags: llm, observability, metrics, logs, traces, prompts, hallucinations, rag, cybersecurity by klotz

Agentic File Search

An AI-powered document search agent that explores files like a human would — scanning, reasoning, and following cross-references. Unlike traditional RAG systems that rely on pre-computed embeddings, this agent dynamically navigates documents to find answers.

2026-01-17 Tags: llm, rag, document search, file exploration, gemini, llamaindex, python, document parsing by klotz

Kreuzberg - A Polyglot Document Intelligence Framework

A polyglot document intelligence framework with a Rust core that extracts text, metadata, and structured information from PDFs, Office documents, images, and 50+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, TypeScript (Node/Bun/Wasm/Deno) or use via CLI, REST API, or MCP server.

2026-01-11 Tags: document-intelligence, text-extraction, metadata-extraction, pdf-extraction, ocr, table-extraction, rust, python, ruby, java, go, php, elixir, typescript, wasm, tesseract, pdfium, rag by klotz

FailSafe: AI-Powered Fact-Checking System

FailSafe is an open-source, modular framework designed to automate the verification of textual claims. It employs a multi-stage pipeline that integrates Large Language Models (LLMs) with retrieval-augmented generation (RAG) techniques.

2026-01-08 Tags: python, knowledge-graph, celery, fact-checking, rag, automateion, vverification, chrome, llm, agents, amin7410 by klotz

MCP-powered RAG Over Complex Docs

A tutorial showing how to use the MCP framework with EyelevelAI's GroundX to build a Retrieval-Augmented Generation (RAG) system for complex documents, including setup of a local MCP server, creation of ingestion and search tools, and integration with the Cursor IDE.

2026-01-05 Tags: mcp, rag, groundx, eyelevelai, cursor ide, document retrieval, agents, fastmcp, documents by klotz

Building a 100% local MCP Client

This article details how to build a 100% local MCP (Model Context Protocol) client using LlamaIndex, Ollama, and LightningAI. It provides a code walkthrough and explanation of the process, including setting up an SQLite MCP server and a locally served LLM.

2026-01-04 Tags: mcp, llamaindex, ollama, llm, local, data science, python, sqlite, agent, rag, dailydoseofds by klotz

Smart Coding MCP

An extensible Model Context Protocol (MCP) server that provides intelligent semantic code search for AI assistants. Built with local AI models using Matryoshka Representation Learning (MRL) for flexible embedding dimensions.

2026-01-02 Tags: javascript, llm, mcp, ast, gemini, cursor, codex, rag, mrl, claude, local, antigravity by klotz

Hands-On AI Engineering

A curated repository of AI-powered applications and agentic systems showcasing practical use cases of Large Language Models (LLMs) from providers like Google, Anthropic, OpenAI, and self-hosted open-source models.

2026-01-02 Tags: llm, agents, rag, retrieval-augmented generation, python, github, sumanth077 by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: retrieval augmented generation*

Linked Tags

Related Tags